LSC-Mine: Algorithm for Mining Local Outliers
نویسندگان
چکیده
Data objects which differ significantly from the remaining data objects are referred to as outliers. Density-based algorithms for mining outliers are very effective in detecting all forms of outliers, where data objects with fewer neighbors are likely to be outliers than are those with more neighbors. However, existing density-based algorithms engage in huge repetitive computation and comparison for every object before the few outliers are detected. Expensive computations might make scalability of these techniques to important applications like quick fraud detection unfeasible. This paper proposes LSC-Mine algorithm based on the distance of an object and those of its knearest neighbors. In addition, data objects that are not possible outlier candidates are pruned which reduces the number of computations and comparisons in LOF technique resulting in an improved performance.
منابع مشابه
Local multivariate outliers as geochemical anomaly halos indicators, a case study: Hamich area, Southern Khorasan, Iran
Anomaly recognition has always been a prominent subject in preliminary geochemical explorations. Among the regional geochemical data processing, there are a range of statistical and data mining techniques as well as different mapping methods, which serve as presentations of the outputs. The outlier’s values are of interest in the investigations where data are gathered under controlled condition...
متن کاملThe Spatial Outlier Mining Algorithm based on the KNN Graph
In order to solve the defect in the spatial outlier mining algorithm that the spatial objects may be affected by their surrounding abnormal neighbors, a Based K-Nearest Neighbor (BKNN) algorithm was proposed based on the working principle of KNN Graph, which could effectively identify the spatial outliers by using cutting edge strategies. The core idea of BKNN is to calculate the dissimilarity ...
متن کاملOutlier Detection in High Dimensional, Spatial and Sequential Data Sets
Of all the data mining techniques, outlier detection seems closest to the definition of “discovering nuggets of information” in large databases. When an outlier is detected, and determined to be genuine, it can provide insights, which can radically change our understanding of the underlying process. The purpose of the research underlying this thesis was to investigate and devise methods to mine...
متن کاملMining for Outliers in Sequential Databases
The mining of outliers (or anomaly detection) in large databases continues to remain an active area of research with many potential applications. Over the last several years many novel methods have been proposed to efficiently and accurately mine for outliers. In this paper we propose a unique approach to mine for sequential outliers using Probabilistic Suffix Trees (PST). The key insight that ...
متن کاملComparison of golden section search method and imperialist competitive algorithm for optimization cut-off grade- case study: Mine No. 1 of Golgohar
Optimization of the exploitation operation is one of the most important issues facing the mining engineers. Since several technical and economic parameters depend on the cut-off grade, optimization of this parameter is of particular importance. The aim of this optimization is to maximize the net present value (NPV). Since the objective function of this problem is non-linear, three methods can b...
متن کامل